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Abstract — We consider an Adaptive Random Convolutional 
Network Coding (ARCNC) algorithm to address tlie issue of 
field size in random network coding for multicast, and study its 
memory and decoding delay performances through both analysis 
and numerical simulations. ARCNC operates as a convolutional 
code, with the coefficients of local encoding kernels chosen 
randomly over a small finite field. The cardinality of local 
encoding kernels increases with time until the global encoding 
kernel matrices at related sink nodes have full rank. ARCNC 
adapts to unknown network topologies without prior knowledge, 
by locally incrementing the dimensionality of the convolutional 
code. Because convolutional codes of different constraint lengths 
can coexist in different portions of the network, reductions in 
decoding delay and memory overheads can be achieved. We show 
that this method performs no worse than random linear network 
codes in terms of decodability, and can provide significant gains 
in terms of average decoding delay or memory in combination, 
shuttle and random geometric networks. 

Index Terms — convolutional network codes, random linear 
network codes, adaptive random convolutional network code, 
combination networks, random graphs. 



I. Introduction 

SINCE its introduction fP\, network coding has been shown 
to offer advantages in throughput, power consumption, 
and security in wireline and wireless networks. Field size 
and adaptation to unknown topologies are two of the key 
issues in network coding. Li et al. showed constructively 
that the max-flow bound is achievable by linear algebraic 
network coding (ANC) if the field is sufficiently large for a 
given deterministic multicast network ||2], while Ho et al. ||3l 
proposed a distributed random Unear network code (RLNC) 
construction that achieves the multicast capacity with proba- 
bility (1 — d/q) '', where 77 is the number of links with random 
coefficients, d is the number of sinks, and q is the field 
size. Because of its construction simplicity and the ability to 
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adapt to unknown topologies, RLNC is often preferred over 
deterministic network codes. While the construction in ||3] 
allows cycles, which leads to the creation of convolutional 
codes, it does not make use of the convolutional nature of the 
resulting codes to lighten bounds on field size, which may need 
to be large to guarantee decoding success at all sinks. Both 
block network codes (BNC) ID, ID and convolutional network 
codes (CNC) 161, Q can mitigate field size requirements. 
Medard et al. introduced the concept of BNC H; Xiao et al. 
proposed a deterministic binary BNC to solve the combination 
network problem |i8j. BNC can operate on smaller finite fields, 
but the block length may need to be pre-determined according 
to network size. In discussing cyclic networks, both Li et al. 
and Ho et al. pointed out the equivalence between ANC in 
cyclic networks with delays, and CNC lU, [Sj- Because of 
coding introduced across the temporal domain, CNC in general 
does not have a field size constraint. 

Combining the adaptive and distributive advantages of 
RLNC and the field-size independence of CNC, we proposed 
adaptive random convolutional network code (ARCNC) in 
m as a localized coding scheme for single-source multicast. 
ARCNC randomly chooses local encoding kernels from a 
small field, and the code constraint length increases locally 
at each node. In general, sinks closer to the source adopts a 
smaller code length than that of sinks far away. ARCNC adapts 
to unknown network topologies without prior knowledge, 
and allows convolutional codes with different code lengths 
to coexist in different portions of the network, leading to 
reduction in decoding delay and memory overheads associated 
with using a pre-determined field size or code length. 

Concurrently to Q, Ho et al. proposed a variable length 
CNC ifTol and provided a mathematical proof to show that 
the overall error probability of the code can be bounded when 
each intermediate node chooses its code length from a range 
estimated from its depth. The encoding process involves a 
graph transformation of the network into a "low-degree" form, 
with each node having a degree of at most 3. Our work differs 
from ifTol in that our approach uses feedbacks algorithmically. 

In this paper, we first describe the ARCNC algorithm in 
acyclic and cyclic networks and show that ARCNC converges 
in a finite amount of time with probability 1 . We then provide 
several examples to illustrate the decoding delay and memory 
gains ARCNC offers in deterministic and random networks. 
Our first example is ("J combination networks. Ngai and 
Yeung have previously pointed out that throughput gains of 



network coding can be unbounded over combination networks 
ifm . Our analysis shows that the average decoding delay is 
bounded by a constant when m is fixed and n increases. 
In other words, the decoding delay gain becomes infinite as 
the number of intermediate nodes increases in a combina- 
tion network. On the other hand, our numerical simulation 
shows that the decoding delay increases sublinearly when 
TO = n/2 and n increases in value. We then consider a 
family of networks defined as sparsified combination networks 
to illustrate the effect of interdependencies among sinks and 
depth of the network on memory use. For cyclic networks, we 
consider the shuttle network as an example. We also extend 
the application of ARCNC from structured cyclic and acyclic 
networks to random geometric graphs, where we provide 
empirical illustration of the benefits of ARCNC. 

The remainder of this paper is organized as follows: the 
ARCNC algorithm is proposed in Section [III performance 
analysis is given in Section Hill The coding delay and memory 
advantages of ARCNC are discussed for combination and 
shuttle networks in Section |IV] Numerical results are provided 
in Section[V]for combination and random networks. Section IVTl 
concludes the paper. 

II. Adaptive Randomized Convolutional Network 

Codes 

A. Basic Model and Definitions 

We model a communication network as a finite directed 
multigraph, denoted hy Q = (V,£), where V is the set of 
nodes and £ is the set of edges. An edge represents a noiseless 
communication channel with unit capacity. We consider the 
single-source multicast case, i.e., the source sends the same 
messages to all the sinks in the network. The source node 
is denoted by s, and the set of d sink nodes is denoted by 
R ~ {ri, . . . ,rd\ C V. For every node v G V, the sets of 
incoming and outgoing channels to v are In{v) and Out{v); 
let In{s) be the empty set 0. An ordered pair (e', e) of edges 
is called an adjacent pair when there exists a node v with 
e' G In{v) and e G Out{v). Since edges are directed, we use 
the terms edge and arc interchangeably in this paper 

The symbol alphabet is represented by a base field, F^. 
Assume s generates a source message per unit time, consisting 
of a fixed number of to source symbols represented by a size 
TO row vector xt = {xi,t,X2,t, ■ ■ ■ ,Xm.t), Xi,t G F,. Time t 
is indexed from 0, with the {t + l)-th message is generated at 
time t. The source messages can be collectively represented 
by a power series x[z) ~ X]t>o ^t^*' where Xt is the message 
generated at time t and z denotes a unit-time delay. x{z) is 
therefore a row vector of polynomials from the ring F^ [z] . 

Denote the data propagated over a channel e by ye{z) = 
St>o2^e,tz*, where j/e,t G is the data symbol sent on edge 
e at time t. For edges connected to the source, let j/e {t) be a 
linear function of the source messages, i.e., for all e G Out{s), 
ye{z) = x{z)fe{z), where /^(z) = Y.t>o fe,tz* is a size 
TO column vector of polynomials from Fg[z]. For edges not 
connected directly to the source, let ye{z) be a linear function 
of data transmitted on incoming adjacent edges e', i.e., for all 



1' 7^ s, e G Out{v), 

ye{z)= ^ ke'^e{z)ye'{z) ■ (1) 

Both ke'.e{z) and y e{z) are in Fg[z]. Define ke'.e{z) = 
St>o ^e'.e,tz^ as the local encoding kernel over the adjacent 
pair (e', e), where fce'.e.t S ^q- Thus, for all e € £, ye{z) is a 
linear function of the source messages, 

ye{z) = x{z)fe{z) , (2) 

where fe{z) = X]t>o/e,t-^* ^i^^ ™ column vector 

defined as the global encoding kernel over channel e, and for 

all 7J 7^ s, e G Out{v), 

fe{z)^ K'Az)fe-{z), (3) 

i.e., /e,t= Yl (H^<='.'=''-^<=''*-M ■ '^'^^ 

e'£ln{v) \i=0 / 

Note that /e^t € F^, and fe'{z),fe{z) G F^[z]. Expanding 
Eq. ([U term by term gives an explicit expression for each data 
symbol ye.t transmitted on edge e at time t, in terms of source 
symbols and global encoding kernel coefficients: 

e'6/ri(u) \i=0 ) i=0 

Each intermediate node v ^ s v& therefore required to store 
in its memory received data symbols ye'.t-i for values of i at 
which fce' .e.i is non-zero. The design of a CNC is the process 
of determining local encoding kernel coefficients fcg' e.t for 
all adjacent pairs (e',e), and /e,t for e G Out{s), such that 
the original source messages can be decoded correctly at the 
given set R of sink nodes. With a random linear code, these 
coding kernel coefficients are chosen uniformly randomly from 
the finite field F^. This paper studies an adaptive scheme 
where kernel coefficients are generated one at a time until 
decodability is achieved at all sinks. 

Collectively, we call the |/7i(w)| x \Out{v)\ matrix Ky{z) = 

{ke',e{z))e'(^In(v),e&Out{v) = Kyfl + Ky,iZ + Ky^2z'^ + ■ ■ ■ the 

local encoding kernel matrix at node v, and the to x 
matrix Fy(z) = ifeiz))e£in{v) th6 global encoding kernel 
matrix at node v. Observe from Eq. ([2) that, at sink r, Fr{z) 
is required to deconvolve the received data messages y^'. (z), 

G In{r). Therefore, each intermediate node v computes 
,fe{z) for outgoing edges from Fu{z) according to Eq.©, and 
sends fe{z) along edge e, together with data ye{z). This can 
be achieved by arranging the coefficients of fe{z) in a vector 
form and attaching them to the data. In this paper, we ignore 
the effect of this overhead transmission of coding coefficients 
on throughput or delay: we show in Section IIII-BI that the 
number of terms in /e(z) is finite, thus the overhead can be 
amortized over a long period of data transmissions. 

Moreover, Fy{z) can be written as Fv{z) — F^^ + Fy,iz + 
■ ■ ■ + Fy^tz*, where Fy^t G jg j-jjg global encoding 

kernel matrix at time t. Fy{z) can thus be viewed as a poly- 
nomial, with Fy^t as matrix coefficients. Let Ly be the degree 
of Fy (z). Ly + 1 is a direct measure of the amount of memory 



required to store F.i,{z). We shall define in Section UlI-CI the 
metric used to measure memory overhead of ARCNC. 



B. Algorithm for Acyclic Networks 

1 ) Code Generation and Data Encoding: initially, all local 
and global encoding kernels are set to 0. At time t, the (t + 1)- 
th coefficient fce',e,t of the local encoding kernel fee' e(z) is 
chosen uniformly randomly from for each adjacent pair 
(e',e), independently from other kernel coefficients. Each 
node V stores the local encoding kernels and forms the outgo- 
ing data symbol as a random linear combination of incoming 
data symbols in its memory according to Eq. (|5). Node v also 
stores the global encoding kernel matrix Fv{z) and computes 
the global encoding kernel fe{z), in the form of a vector of 
coding coefficients, according to Eq. During this code 
construction process, fe{z) is attached to the data transmitted 
on e. Once code generation terminates and the CNC Fr{z) is 
known at each sink r, fe{z) no longer needs to be forwarded, 
and only data symbols are sent on each outgoing edge. Recall 
that we ignore the reduction in rate due to the transmission of 
coding coefficients, since this overhead can be amortized over 
long periods of data transmissions. 

In acyclic networks, a complete topological order exists 
among the nodes, starting from the source. Edges can be 
ranked such that coding can be performed sequentially, where 
a downstream node encodes after all its upstream nodes have 
generated their coding coefficients. Observe that we have not 
assumed non-zero transmission delays. 

2) Testing for Decodability and Data Decoding: at every 
time instant t, each sink r decides whether its global encoding 
kernel matrix Fr{z) is full rank. If so, it sends an ACK 
signal to its parent node. An intermediate node v which has 
received ACKs from all its children at time <o will send 
an ACK to its parent, and set all subsequent local encoding 
kernel coefficients k^' ,e.t to for all t > to, e' G In{v), 
and e G Out{v). In other words, the constraint lengths of 
the local convolutional codes increase until they are sufficient 
for downstream sinks to decode successfully. Such automatic 
adaptation eliminates the need for estimating the field size or 
the constraint length a priori. It also allows nodes within the 
network to operate with different constraint lengths as needed. 

If Fr{z) is not full rank, r stores received messages and 
waits for more data to arrive. At time t, the algorithm is 
considered successful if all sinks can decode. This is equiv- 
alent to saying that the determinant of Fr{z) is a non-zero 
polynomial. Recall from Section Bl-AI Fr{z) can be written 
as Fr{z) — Fr.o + Fr.iz + - ■ ■ + Fr^tz*, where Fr.t is the global 
encoding kernel matrix at time t. Computing the determinant 
of Fr{z) at every time instant t is complex, so we test instead 
the following two conditions, introduced in lfT2l and ifTSl 
to determine decodability at a sink r. The first condition is 
necessary and easy to compute, while the second is both 
necessary and sufficient, but slightly more complex. 

1) rank(Fr.t) = m, where Fr.t = {Fr^, Fr.i,. . . , Fr.t). 



2) rank{Mr^t) — rank{Mr^t-i) = m, where 

^ Fr.O Fr.l ■ ■ ■ Fr.i 



Mr.i = 



(6) 



• • • Frfi Fr.l 
V ••• Fr,0 J 

Once Fr{z) is full rank, r can perform decoding operations. 
Let Tr be the^rsf decoding time, or the earliest time at which 
the decodability conditions are satisfied. Denote by Xq"^ and 
y^'' the row vectors (xq, • • • ,xt^) and [uq,--- ,J/t^)- Each 
source message xt is a size m row vector of source symbols 
Xi^t G generated at s at time t; each data message ijt is a 
size In{r) row vector of data symbols y^.t <= Fg received on 
the incoming edges of r at time t, e G In{r). Hence, yj^'' = 
Xq'' Mt^. To decode, we want to find a size In{r){Tr + 1) x m 
matrix D such that Mt^D — (^q") ■ '-^'^ then recover source 
message by evaluating y'^^ D = Xq''Mt^D ~ xq. Once D 
is determined, we can decode sequentially the source message 
Xt at time t + Tr, t > 0. Note that if 1/71(7") | > m, we can 
simplify the decoding process by using only m independent 
received symbols from the |/7t(r)| incoming edges. 

Observe that, an intermediate node v only stops lengthening 
its local encoding kernels ke'.e{z) when all of its downstream 
sinks achieve decodability. Thus, for a sink r with first 
decoding time Tr, the length of Fr{z) can increase even after 
Tr- Recall from Section lTl-Al that Lr is the degree of Fr (z). We 
will show in Section UlI-BI that ARCNC converges in a finite 
amount of time for a multicast connection. In other words, 
when the decodability conditions are satisfied at all sinks, 
the values of Lr and T,. at an individual sink r satisfy the 
condition Lr > Tr, where Lr is finite. Decoding of symbols 
after time Tr can be conducted sequentially. Details of the 
decoding operations can be found in 1(141 . 

3) Feedback: As we have described in the decoding sub- 
section, acknowledgments are propagated from sinks through 
intermediate nodes to the source to indicate if code length 
should continue to be increased at coding nodes. ACKs are 
assumed to be instantaneous and require no dedicated network 
links, thus incurring no additional delay or throughput costs. 
Such assumptions may be reasonable in many systems since 
feedback is only required during the code construction process. 
Once code length adaptation finishes, ACKs are no longer 
needed. We show in Section UlI-BI that ARCNC terminates in 
a finite amount of time. Therefore, the cost of feedback can 
be amortized over periods of data transmissions. 

C. Algorithm Statement for Cyclic Networks 

In an acyclic network, the local and global encoding kernel 
descriptions of a linear network code are equivalent, in the 
sense that for a given set of local encoding kernels, a set 
of global encoding kernels can be calculated recursively in 
any upstream-to-downstream order In other words, a code 
generated from local encoding kernels has a unique solution 
when decoding is performed on the corresponding global 
encoding kernels. By comparison, in a cyclic network, partial 
orderings of edges or nodes are not always consistent. Given a 
set of local encoding kernels, there may exist a unique, none. 



Fig. 1 . A sample cyclic network with edges numerically indexed. The set of 
indices is not unique, and depends on the order at which nodes are visited, 
starting from s. On the left, ri is visited before r2; on the right, r2 is visited 
before ri. In each case, (e', e) is highlighted with a curved aiTow if e' >^ e. 

or multiple sets of global encoding kernels (§3.1, ifTSll ). If 
the code is non-unique, the decoding process at a sink may 
fail. A sufficient condition for a CNC to be successful is that 
the constant coefficient matrix consisting of all local encoding 
kernels be nilpotent | fT6| ; this condition is satisfied if we code 
over an acyclic topology at i = ||T21 . In the extreme case, 
all local encoding kernels can be set to at t = 0. This setup 
translates to a unit transmission delay on each link, which as 
previous work on RLNC has shown, guarantees the uniqueness 
of a code construction ||3|. To minimize decoding delay, it 
is intuitive to make as few local encoding kernels zero as 
possible. In other words, a reasonable heuristic is to assign 
to a minimum number of ke',e.o, (e',e) G £, and to assign 
values chosen uniformly randomly from ¥q to the rest. The 
goal is to guarantee that each cycle contains at least a single 
delay. 

Although seemingly similar, this process is actually not the 
same as the problem of finding the minimal feedback edge 
set. A feedback edge set is a set containing at least one edge 
of every cycle in the graph. When a feedback edge set is 
removed, the graph becomes an acyclic directed graph. In our 
setup, however, since fce'.e,o is specific to an adjacent edge 
pair, fce'.e.o does not need to be for all e' where (e', e) G V. 

For example, a very simple but not necessarily delay- 
optimal heuristic is to index all edges, and to assign to fce'.e,o 
if e' >: e, i.e., when e' has an index larger than e; ke^e.o is 
chosen randomly from ¥g if e' -< e. Fig. [T] illustrates this 
indexing scheme. A node is considered to be visited if one of 
its incoming edges has been indexed; a node is put into a queue 
once it is visited. For each node removed from the queue, 
numerical indices are assigned to all of its outgoing edges. 
Nodes are traversed starting from the source s. The outgoing 
edges of s are therefore numbered from 1 to \Out{s)\. Note 
that the index set thus obtained is not necessarily unique. In 
this particular example, we can have two sets of edge indices, 
as shown in Fig.[T] Here ri is visited before r2 on the left, and 
vice versa on the right. In each case, an adjacent pair (e', e) 
is highlighted with a curved arrow if e' e. At < = 0, we 
set fce',e.o to for such highlighted adjacent pairs, and choose 
fce' e.o uniformly randomly from for other adjacent pairs. 

Observe that, in an acyclic network, this indexing scheme 
provides a total ordering for the nodes as well as for the 
edges: a node is visited only after all of its parents and 
ancestors are visited; an edge is indexed only after all edges 
on any of its paths from the source are indexed. In a cyclic 
network, however, an order of nodes and edges is only partial. 



(a) (b) 

Fig. 2. A convolution code resulting from self-loops on a network with 
transmission delays, (a) CNC in a delay-free network. Data transmitted on 
incoming edges are y^i (z) and ye{z) respectively. The local encoding kernels 
are given by i^cNcC-^)- (b) Equivalent ANC in a network with delays. The 
given self-loop carries a single delay z. Incoming data symbolsare y^i j and 
y^ii ,t 2t time t. The ANC coding coefficients are given by the matrix Kanc- 

with inconsistencies around each cycle. Such contradictions in 
the partial ordering of edges make the generation of unique 
network codes along each cycle impossible. By assigning 
to local encoding kernels fcg/ e.o for which e' >z e, such 
inconsistencies can be avoided at time 0, since the order of 
e' and e becomes irrelevant in determining the network code. 
After the initial step, fcg' ^ j is not necessarily for e' >: e, 
t > 0, nonetheless the convolution operations at intermediate 
nodes ensure that the coefficient inserted at f = makes 
the global encoding kernels unique at the sinks. This idea 
can be derived from the expression for f^.t given in Eq. (|4]i. 
In each cycle, there is at least one fce',e.o that is equal to 
zero. The corresponding fe'.t therefore does not contribute to 
the construction of other /e,t's in the cycle. In other words, 
the partial ordering of arcs in the cycle can be considered 
consistent at t = 1 and later times. 

Although this heuristic for cyclic networks is not optimal, 
it is universal. One disadvantage of this approach is that full 
knowledge of the topology is required at i = 0, making the 
algorithm centralized instead of entirely distributed. Nonethe- 
less, if inserting an additional transmission delay on each link 
is not an issue, we can always bypass this code assignment 
stage by zeroing all local encoding kernels at < = 0. 

After initialization, the algorithm proceeds in exactly the 
same way as in the acyclic case. 

III. Analysis 

A. Success probability 

Discussions in ||2l, |[3l, |fT6l state that in a network with de- 
lays, ANC gives rise to random processes which can be written 
algebraically in terms of a delay variable z. Thus, a convo- 
lutional code can naturally evolve from message propagation 
and linear encoding. ANC in the delay-free case is therefore 
equivalent to CNC with constraint length 1 . Similarly, using a 
CNC with constraint length Z > 1 on a delay-free network is 
equivalent to performing ANC on the same network, but with 
l — l self-loops attached to each encoding node. Each self-loop 
carries z, z^, . . . , z'^^ units of delay respectively. 

For example, in Fig. [2] we show a node with two incoming 
edges. Let the data symbol transmitted on edge e at time t be 
Ue.t- A CNC with length I = 2 is used in (a), assuming that 



transmissions are delay-free. The local encoding kernel matrix 
KcNc{z) contains two polynomials, ke'^eiiz) = fce'.ei.o + 
ke<^eiAZ and k^^^eAz) = k^,, ,.^ ^ + k^„^^^^^z. According to 
Eq. ([T]) and (|5]), the data symbol transmitted on ei at time t is 

VeiA = y^ Vedke^eifi + ye,t-lke,ei,l ■ (7) 
eE{e',e"} 

In (b), the equivalent ANC is shown. A single loop with 
a transmission delay of z has been added, and the local 
encoding kernel matrix T^anc = (fce,e)eG/r,(«),eGOut(t,) is 
constructed from coding coefficients from (a). The first col- 
umn of A'anc represents encoding coefficients from incoming 
edges e', e", e'" to the outgoing edge 62, and the second 
column represents encoding coefficients from incoming edges 
e', e", e'" to the outgoing edge e'". Using a matrix nota- 
tion, the output data symbols from v are (ye2,t Ue'" ,t) = 

{ye',t Ve",t ye"',t~l)KANC, 1-6., 

ye"',t = TJe'^tke'.e'" + Ve" ,tke" ,e- + Ve'" ,S 
— Ue' ,tke' ,ei,l + J/e" ,t ^e" ,ei ,1 
2/e2,t = ye',tke',e2 + 2/e" ,t ^e" ,62 + 2/6'" ,t-l ^e'" ,62 
= ye',tke',etfi + 2/e" ,t ^e" ,ei ,0 + Ve'" ,t-l 

= E ye,t^e,ei,0 + 2/e,t-lfce.ei,l (8) 
eg{e',e"} 

Clearly j/e^ t is equal to ye2,t- ARCNC therefore falls into 
the framework given by Ho et al. ||3], in the sense that the 
convolution process either arises naturally from cycles with 
delays, or can be considered as computed over self-loops 
appended to acyclic networks. Applying the analysis from ||3], 
we have the following theorem. 

Theorem 1: For multicast over a general network with d 
sinks, the ARCNC algorithm over can achieve a success 
probability of at least (1 - at time t, if > d, 

and T] is the number of links with random coefficients. 

Proof: At node v, k^' .e{z) at time t is a polynomial with 
maximal degree t, i.e., ke',e{z) = fce',e.o + fce'.e.i^ + • • • + 
ke',e,tz^, ke\e.i IS randomly chosen over Fg. If we group the 
coefficients, the vector k^'^e = {ke' ,e.o, ke/ .e,i, ■ ■ ■ ,ke',e.t} is 
of length t + 1, and corresponds to a random element over 
the extension field F^t+i. Using the result in |[3l, we conclude 
that the success probability of ARCNC at time t is at least 
(1 - d/(7*+i)'', as long as >(i. ■ 

We could similarly consider the analysis done by Balli et 
al. IfTTI . which states that the success probability is at least 
(1 — |J| being the number of encoding nodes, 

to show that a tighter lower bound can be given on the success 
probability of ARCNC, when (7*+^ > d. 

B. First decoding time 

As discussed in Section III-B2I we define the first decoding 
time T,. for sink r, 1 < r < d, as the time it takes r to achieve 
decodability for the first time. We had called this variable the 
stopping time in ||9l- Also recall that when all sinks are able to 
decode, at each sink r, can be smaller than L,., the degree 
of the global encoding kernel matrix Fr{z). Denote by T^r 
the time it takes for all sinks in the network to successfully 



decode, i.e., T^r ~ max{Ti, . . . ^T(i\, then T^r is also equal 
to max{Li, . . . , L^j}. The following corollary holds: 

Corollary 2: For any given < £ < 1, there exists a Tg > 
such that for any t > To, ARCNC solves the multicast problem 
with probability at least 1 — e, i.e., P{Tn > t) < e. 

Proof: Let To = [log^d - log,(l - ^1^)] - 1, then 
To+l > [logq d] since < e < 1, and {l-d/q'^'>+'^)'' > 1-e. 
Applying Theorem [T] gives P{Tn > t) < P{Tn > To) < 
1 - (1 - d/q^^+^yi < e for any t > To, ■ 

Since Pr{U°Zt[TN < t]} = l-Pr{n,^jTw > t]} > 1-e, 
Corollary 12] shows that ARCNC converges and stops in a finite 
amount of time with probability 1 for a multicast connection. 

Another relevant measure of the performance of ARCNC is 
the average first decoding time, T^yg = ^ X]r=i Observe 
that £;[Tavg] < E[Tn], where 

[log, 00 
E[Tn]= J2 PiT^N>t)+ J2 P(TN>t) 

<riog,di-i+ [i-a--)"] 
.riog,<fl-i + |:(-i,-0-f,;ji^. 

When q is large, the summation term approximates 1 — 
(1 — (i/?)'' by the binomial expansion. Hence as q increases, 
the second term above decreases to 0, while the first term 
[log^ d] — 1 is 0. i?[ravg] is therefore upper-bounded by a 
term converging to 0; it is also lower bounded by because 
at least one round of random coding is required. Therefore, 
^'[Tavg] converges to as q increases. In other words, if the 
field size is large enough, ARCNC reduces in effect to RLNC. 

Intuitively, the average first decoding time of ARCNC 
depends on the network topology. In RLNC, all nodes are 
required in code in finite fields of the same size; thus the 
effective field size is determined by the worst case sink. 
This scenario corresponds to having all nodes stop at Tat in 
ARCNC. ARCNC enables each node to decide locally what is 
a good constraint length to use, depending on side information 
from downstream nodes. Since i?[Tavg] < ^^[1^], some nodes 
may be able to decode before Tj^. The corresponding effective 
field size is therefore expected to be smaller than in RLNC. 
Two possible consequences of a smaller effective field size are 
reduced decoding delay, and reduced memory requirements. 
In Section HVl we confirm through simulations that such gains 
can be attained by ARCNC. 

C. Memory 

To measure the amount of memory required by ARCNC, 
first recall from Section III-AI that at each node u, the global 
encoding kernel matrix F^ (z), the local encoding kernel matrix 
K^(z), and past data ye'{z) on incoming arcs e' € In{v) 
need to be stored in memory. Ky{z) and ye'{z) should always 
be saved because together they generate new data symbols 
to transmit (see Eqs. ([T]i and (|5]l). Fy{z) should be saved 
during the code construction process at intermediate nodes, 
and always at sinks, since they are needed for decoding. 



Let us consider individually the three contributors to mem- 
ory use. Firstly, recall from Section III-AI that Fy{z) can be 
viewed as a polynomial in z. When all sinks are able to 
decode, at node v, Fy{z) has degree L„, with coefficients from 
jpm.x/n(D) rpj^g j^j^j ^jj^Q^jjj Qf memory needed for Fy{z) is 
therefore proportional to \\0g2 qlm I n{v){Ly + 1). Secondly, 
from Eq. dUi, we see that the length of a local encoding kernel 
polynomial ke'^e{z) should be equal to or smaller than that of 
fe{z)- Thus, the length of Ky{z) should also be equal to or 
smaller than that of Fy{z). The coefficients of Ky{z) are ele- 
ments of Fg"'"'^'^"*''"''. Hence, the amount of memory needed 
for Ky{z) is proportional to \\0g2 q]Out{v)In{v){Ly + 1). 
Lastly, a direct comparison between Eqs. (|4) and (|5]l shows 
that memory needed for y^' {z), e' € In{v) is the same for that 
needed for In practical uses of network coding, data 

can be transmitted in packets, where symbols are concatenated 
and operated upon in parallel. Packets can be very long in 
length. Nonetheless, the exact packet size is irrelevant for 
comparing memory use between different network codes, since 
all comparisons are naturally normalized to packet lengths. 

Observe that, m is the number of symbols in the source 
message, determined by the min-cut of the multicast connec- 
tion, independent of the network code used. Similarly, In{v) 
and Out{v) are attributes inherent to the network topology. 
To compare the memory use of different network codes, we 
can omit these terms, and define the average memory use of 
ARCNC by the following common factor: 



|V| 



(9) 



In RLNC, Lu = 0, and the expression simplifies to [log2 q\ , 
which is the amount of memory needed for a single finite field 
element. 

One point to keep in mind when measuring memory use is 
that even after a sink achieves decodability, its code length can 
still increase, as long as at least one of its ancestors has not 
stopped increasing code length. We say a non-source node v 
is related to a sink r if u is an ancestor of r, or if v shares an 
ancestor, other than the source, with r. Hence, Lr is dependent 
on all nodes related to r. 

D. Complexity 

To study the computation complexity of ARCNC, first 
observe that, once the adaptation process terminates, the 
computation needed for the ensuing code is no more than a 
regular CNC. In fact, the expected computation complexity 
is proportional to the average code length of ARCNC. We 
therefore omit the details of the complexity analysis of regular 
CNC here and refer interested readers to [0. 

For the adaptation process, the encoding operations are 
described by Eq. (|4|i. If the algorithm stops at time T^, the 
number of operations in the encoding steps is 0{Din |5|r^m), 
where Din = max^gy 

To determine decodability at a sink r, we check if the rank 
of F,.{z) is m. A straight-forward approach is to check whether 
its determinant is a non-zero polynomial. Alternatively, Gaus- 
sian eUmination could be applied. At time t, because Fr{z) 




Fig. 3. A combination network 



is an m x |/n(r)| matrix and each entry is a polynomial with 
degree t, the complexity of checking whether Fr{z) is full 
rank is 0{Df^^2"^mt^). Instead of computing the determinant 
or using Gaussian elimination directly, we propose to check the 
conditions given in Section III-BI For each sink r, at time t, 
determining rank Fq Fi ■ ■ ■ Ft ) requires 0{Df^mP) 
operations. If the first test passes, we calculate rank{A4t) and 
rank{Mt-i) next. Observe that rank{Mt-i) was computed 
during the last iteration. Mt is a {t+l)mx {t+l)\In{r) \ matrix 
over field ¥q. The complexity of calculating rank{Mt) by 
Gaussian elimination is 0{D^^-^m£'). The process of checking 
decodability is performed during the adaptation process only, 
hence the computation complexity here can be amortized over 
time after the coding coefficients are determined. In addition, 
as decoding occurs symbol-by-symbol, the adaptation process 
itself does not impose any additional delays. 

IV. Examples 

In this section, we describe the application of ARCNC 
in three structured networks: the combination and sparsified 
combination networks, which are acyclic, and the shuttle net- 
work, which is cyclic. For the combination network, we bound 
the expected average first decoding time; for the sparsified 
combination network, we bound the expected average memory 
requirement. In addition, the shuttle network is given as a very 
simple example to illustrate how ARCNC can be applied in 
cyclic networks. 

A. Combination Network 

A (^) combination network contains a single source s 
that multicasts m independent messages over Fg through n 
intermediate nodes to d sinks |1T5|; each sink is connected 
to a distinct set of m intermediate nodes, and d = 
Fig. |3] illustrates the topology of a combination network. 
Assuming unit capacity links, the min-cut to each sink is 
m. It can be shown that, in combination networks, routing 
is insufficient and network coding is needed to achieve the 
multicast capacity m. Here coding is performed only at s, 
since each intermediate node has only s as a parent node; an 
intermediate node simply relays to its children data from s. 
For a general ( combination network, we showed in ||9l that 
the expected average first decoding time can be significantly 
improved by ARCNC when compared to the deterministic 
BNC algorithm. We restate the results here, with details of 
the derivations included. 



At time t — I, for a sink r that has not satisfied the decod- 
abiUty conditions, Fr{z) is a size mxm matrix of polynomials 
of degree t — 1. Fr{z) has full rank with probability 



Next, we want to bound the variance of T^^yg, i.e.. 
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(10) 



Hence, the probability that sink r decodes after time t — 1 is 

" / 1 \ 
> t) = 1 - Q = 1 - [] 1 - — , t > 0. (11) 



1=1 



The expected first decoding time for sink node r is therefore 
upper and lower-bounded as follows. 
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(18) 

^ETLB{m,q). (19) 



Recall from Section IIII-BI that the expected average first 



decoding time is E[Tr,yg] = E ^ Y.r=i 



In a combination 



network, i?[ravg] is equal to i?[T,.]. Consequently, i?[Tavg] is 
upper-bounded by ETjjb, defined by Eq. ( fTSI l. ETjjb is a 
function of m and q only, independent of n. For example, if 
m = 2, q — 2, ETjjb = f • If Ti is fixed, but n increases, 
£'[Tavg] does not change. In addition, if q is large, ETjjb 
becomes 0, consistent with the general analysis in ||9]. 
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We upper-bound the terms above one by one. First, 
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(22) 

(23) 

(24) 

(25) 

(26) 
(27) 



Eq. ( |23] ) follows through organization and simplifying. 
Eq. (|24] | is obtained by replacing the terms in Eq. (l23T l with the 
upperbound of Eq. ( fTOt . We represent Eq. (l24t with binomial 
expansion and substitute with the upperbound in Eq. ( fT6] l. 
Next, let Pa = i?[rrTr'] if sinks r and r' share A parents, 
< A < m. Thus, po = E^[Tr]. When X ^ 0, given sink r 
succeeds in decoding at time ti, the probability that sink r' 
has full rank before t2 is lower-bounded as follows. 
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m — A 



(28) 



Consequently, if A 7^ 0, 

PA = E[T,Tr^] (29) 

= Y.Y. *1^2P(T, = h)P{Tr' = t2\Tr - tl) (30) 
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^ t^P{Tr = tl) ^ P(T,, > t2\Tr ^ h) (31) 
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Let = max{pi ;7B, . . . ^ Pm-i,UB}- For a sink r. Let the 
number of sinks that share at least one parent with r be A, 
then A = d - 1 - ("-™). Thus, the middle term in Eq. ^ 

A d^l_AE-^[Tr] and 




is bounded by ^puB 



war [Tavg] < 
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Depending on the relative values of n and to, we have the 
following three cases. 
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(39) 
(40) 
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(41) 



Observe from Eqs. (fTSI l and ( fT9t that all of the upper- 
bound and lower-bound constants are functions of to 
and q only. If to and q are fixed and n increases, in 
Eq. dTTT i. both and A±i approaches 0. Therefore, 
var{T) diminishes to 0. Combining this result with the 
upper-bound ETub, we can conclude that, when to is 
fixed, even if more intermediate nodes are added, a large 
proportion of the sink nodes can still be decoded within 
a small number of coding rounds. 
n = 2m, then ("-'") ^ = l-l,wd 
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(42) 



Here to and n are comparable in scale, and the bounds 
depend on the exact values of ET'^^, pjjs and ETjjb- 
We will illustrate through simulation in Section IV-AI that 
in this case, Tavg also converges to 0. 
. n < 2m, then ("-™) = 0, ^ = i _ i , and 



(ET^] 



UB 



PuB-ETls, (43) 



similar to the second case above. 

Comparing with the deterministic BNC by Xiao et al. ||8], 
we can see that, for a large combination network, with fixed 
q and to, ARCNC achieves much lower first decoding time. 
In BNC, the block length is required to be p > n — to at 
minimum; the decoding delay increases at least linearly with 
n, where as in ARCNC, the expected average first decoding 
time is independent of the value of n. On the other hand, 
with RLNC m, the multicast capacity can be achieved with 
probability (1 — d/q)"'. The exponent n is the number of 
links with random coefficients; since each intermediate node 
has the source as a single parent, coding is performed at the 
source only, and coded data are transmitted on the n outgoing 
arcs from the source. When q and to are fixed, the success 
probability of RLNC decreases exponentially in n. Thus, an 
exponential number of trials is needed to find a successful 
RLNC. Equivalently, RLNC can use an increasingly large field 
size q to maintain the same decoding probability. 

So far we have used ("J combination networks explicitly 
to illustrate the operations and the decoding delay gains of 
ARCNC. It is important to note, however, that this is a 
very restricted family of networks, in which only the source 
is required to code, and each sink shares at least 1 parent 
with other (J^) — {^^) — 1 sinks. In terms of memory, if 
sink r cannot decode, all sinks related to r are required to 
increase their memory capacity. Recall from Subsection IIII-CI 
that a non-source node v is said to be related to a sink 
r if V is an ancestor of r, or if v shares an non-source 
ancestor with r. As n becomes larger, the number of nodes 
related to r increases, especially if to increases too. Thus, 
in combination networks, we do not see considerable gains in 
terms of memory overheads when compared with BNC, unless 
TO is small. In more general networks, however, when sinks 
do not share ancestors with as many other sinks, ARCNC 
can achieve gains in terms of memory overheads as well, 
in addition to decoding delay. As an example, we define a 
sparsified combination network next. 

B. Regular sparsified Combination Network 

We define a regular sparsified combination network as a 
modified combination network, with only consecutive inter- 
mediates nodes connected to unique sink nodes. The framed 
component in Fig. |4] illustrates its structure. Source s multi- 
casts m independent messages through m intermediate nodes 
to each sink, with n — m + 1 sinks in total. This topology can 
be viewed as an abstraction of a content distribution network, 
where the source distributes data to intermediate servers, and 
clients are required to connect to I servers closest in distance 
to collect enough degrees of freedom to obtain the original 
data content. This network can be arbitrarily large in scale. 



Fig. 5. The shuttle network. Each Unk has unit capacity, s is the source; ri 
and ri are sinks each with a min-cut of 2. Edges are directed and labeled as 
ei, 1 < i < 10. Edges indices are assigned according to Section III-CI An 
adjacent pair (e', e) is labeled with a curved pointer if e' >: e. 



Fig. 4. A regular sparsified combination network (inside the dotted frame) 
with an extension. 



In a regular sparsified combination network, the number of 
other sinks related to a sink r is fixed at 2(m — 1), and is 
even smaller if r's parents are on the edge of the intermediate 
layer Thus, the average first decoding time of sinks in a regular 
sparsified combination network behaves similarly to the fixed 
TO case discussed in the previous subsection, approaching as 
n goes to infinity. 

On other hand, since now each intermediate node is con- 
nected to a fixed number of m sinks as well, when a sink 
r fails to decode and requests an increment in code length, 
a maximum of m + 2(m — 1) = 3to — 2 related nodes are 
required to increase their memory capacity. To compute Wavg 
using Eq. observe that for a sink r, assuming there are 
the maximum number of 2 (to — 1) other sinks related to r, 
the cumulative probabiUty distribution of is as follows 



Vx{Lr < t} 

<t} 

= Pr{Tr-m+l < i}P'^{Tr-rn-\-2 < ^l^r-m+l < ^} 

<t\Tr 1 < t, . . . , Tr+m-2 < t} 

= Q f 1 - - 

where Q is defined in Eq. ( fTol i. Thus, using the derivation 
from Eq. (fTTT i to (fTSI l, we have 



E[Lr] = V P{Lr > t) 




n 

i=l 



2m-2N 



<ETuB{im~2,q) 

Similarly, for an intermediate node v, we can bound i?[Lu] by 
ET{jB{2m, q). Clearly Wavg.ARCNC computed using Eq. (|9]l is a 
function of to and q only, independent of n. In other words, in 
a regular sparsified combination network, since each sink has 
a fixed number of parents, and are related to a fixed number of 
other sinks through its parents, the average amount of memory 
use across the network is independent of n. 



On the other hand, assume a field size of qji is used for 
RLNC code generation. There is a single coding node in the 
network, with n — to + 1 sinks. To guarantee an overall success 
probability larger than 1 — e, we have (1 — )^ > 1 — e 

from ifTTl . Hence 



-B[Wavg,RLNc] = \\0g2qR] > 



n-m+1 
1 - V 1 - e 



which can be very large if n is large and e is small. 

Comparing the lower-bound on £'[Wavg,RLNc] and the upper- 
bound on i?[Wavg.ARCNc], we see that the gain of ARCNC 
over RLNC in terms of memory use is infinite as n increases 
because i?[Wavg,ARCNc] is bounded by a constant value. 

An intuitive generalization of this observation is to extend 
this regular sparsified combination network by attaching an- 
other arbitrary network off one of the sinks, as shown in Fig. |4] 
Regardless of the depth of this extension from the sink r, as n 
increases, memory overheads can be significantly reduced with 
ARCNC when compared with RLNC, since most of the sinks 
and intermediate nodes are unrelated to r, thus not affected 
by the decodabiUty of sinks within the extension. 

C. Shuttle Network 

In this section, we illustrate the use of ARCNC in cyclic 
networks by applying it to a shuttle network, shown in Fig. |5] 
We do not provide a formal definition for this network, 
since its topology is given explicitly by the figure. Source 
s multicasts to sinks ri and r2- Edges Ci, 1 < i < 10, 
are directed. The edge indices have been assigned according 
to Section III-CI An adjacent pair (e', e) is labeled with a 
curved pointer if e' >: e. There are three cycles in the 
network; the left cycle is formed by 63, 65, and 67; the middle 
cycle is formed by 65, eg, ee, and eg; the right cycle is 
formed by 64, eg, and ciq. In this example, we use a field 
size q = 2. At node v, the local encoding kernel matrix 

is Ky{z) = {ke',e{z))e'£ln{v),e£Out{v) = Kvfl + Ky^iZ + 

Kv,2z'^ + . . .; each local encoding kernel is a polynomial, 

ke',e{z) = fce'.e.O + fce',e.l-Z + fce'.e,22^^ + . . .. At S, aSSUmC 

/ej (z) = (p), and (z) = ("j*), i.e., the data symbols sent out 
from s at time t are yei,t = •'J^i.t and j/g^ t = X2,t, respectively. 
The source can also linearly combine source symbols before 
transmitting on outgoing edges. 




Fig. 6. An example of local encoding kernel matrices at t = 0. For a node v, 

Kv(z) = {fce',e(^))e'e/n(u),eeOut(u) = l^v,0 + K^^iZ + Ky^2Z^ + . . .. 

For any adjacent pair (e',e) where e' >^ e, fcg' e o = 0- Each edge e is 
labeled with the data symbol y^^t it carries, e.g., )/ei,o = ^1,0- 




Fig. 7. An example of local encoding kernel matrices at t = 1. 

At f = 0, we assign to local encoding kernel coefficients 
ke'.e.o if e' >z e; and choose fce' ^.o uniformly randomly from 
F2 otherwise. One possible assignment is given in Fig.|6] Here 
we circle fcg/ e,o if e' >: e. Since q = 2, we set all other 
local encoding kernel coefficients to 1. The data messages 
transmitted on each edge at i = are then derived and labeled 
on the edge. Observe that, ri receives xi and r2 receives 
X2,o; neither is able to decode both source symbols. Hence no 
acknowledgment is sent in the network. 

At t = 1, we proceed as in the acyclic case, randomly 
choosing coefficients ke',e,i from F2. Since no acknowledg- 
ment has been sent by ri or r2, all local encoding kernels 
increase in length by 1 . One possible coding kernel coefficient 
assignment is given in Fig. Q Both vi and W3 have one 
incoming edge only and thus route instead of code, i.e., 
Kviiz) = Ky^{z) = (1 1). The other local encoding kernel 
matrices in this example are as follows 



Data symbols generated according to Eq. (|5) for this particular 
code are also labeled on the edges. For example, on edge 
65 = {v4,vi), the data symbol transmitted at t = 1 is 

2/65,1 = ^63, 0^63, 65,1 + 2/63,1^63,65,0 

+ 2/69,0^69,65,1 + 2/69,1^69,65,0 

(44) 

= xi^Q ■ 1 + (xi^o + a:i,i) • 1 + X2,o • 1 + 2/69,1 ' 
= 2:1,1 + a;2,o 

Observe that there are no logical contradictions in any of 
the three cycles. For example, in the middle cycle, on 65, 
regardless of the value of 2/69.1, the incoming data symbol at 
t = 1, 2/65,1, can be evaluated as in Eq. ( |44] |. In other words, 
in evaluating the global encoding kernel coding coefficients 
according to Eqs. (O and (|4]i, even though fpg.i is unknown, 
/e5,i can still be computed since A:eg.e5.o = 0. 

Also from Fig. |7] observe that both sinks can decode two 
source symbols at i = 1: ri can decode .ti,i and .T2,o, while r2 
can decode 2:2,1 and a;i_o- Equivalently, we can compute the 
global encoding kernel matrices and check the decodability 
conditions given in Section III-BI We omit the details here, 
but interested readers can verify using Eq. (|3) that the global 
encoding matrices are Fr-^(z) = l^, Fr^iz) — ^-fr)' ^i^^ 
the decodability conditions are indeed satisfied. Acknowledg- 
ments are sent back by both sinks to their parents, code lengths 
stop to increase, and ARCNC terminates. The first decoding 
time for both sinks is therefore T^j = T^^ = 1. 

As we have discussed in Section III-CI the deterministic 
edge indexing scheme proposed is an universal but heuristic 
way of assigning local encoding kernel coefficients at t = 0. 
In this shuttle network example, observe from Fig. |6]that in 
the middle cycle composed of edges es, eg, eg and e^, this 
scheme introduces two zero coefficients, i.e., fees, 6,3,0 = 0, and 
^69,65,0 = 0. A better code would be to allow one of these 
two coefficients to be non-zero. For example, if ke^.e^.o = 1, 
Ky^ (z) = (J) at t = 0. It can be shown in this case that the 
data symbol transmitted on eio to r2 is 2/eio,o = a^i.o + 2:2,0, 
enabling r2 to decode both source symbols at time t ~ 0. 

V. Simulations 

We have shown analytically that ARCNC converges in finite 
steps with probability 1, and that it can achieve gains in 
decoding time or memory in combination networks. In what 
follows, we want to verify these results through simulations, 
and to study numerically whether similar behaviors can be 
observed in random networks. We implemented the proposed 
encoder and decoder in MATLAB. In all instances, it can be 
observed that decoding success was achieved in a finite amount 
of time. All results plotted in this section are averaged over 
1000 runs. 

A. Combination Network 

Recall from Section BV-AI that an upper bound ETub and 
a lower bound ETlb for the average expected first decoding 
time i?[ravg] can be computed for a ( "J combination network. 
Both are functions of m and q, independent of n. In evaluating 
war [Tavg], three cases were considered, ?i > 2m, ?i — 2m, and 



Fig. 8. Average first decoding time and average memory use. m = 2, n 
increases, field size q also increases. Also plotted are the computed upper and 
lower bounds on Tavg for q = 2. 



n < 2m. When n > 2m, the number of sinks unrelated to 
a given sink r is significant. If it takes r multiple time steps 
to achieve decodability, not all other sinks and intermediate 
nodes have to continue increasing their encoding kernel length 
to accommodate r. Thus, ARCNC can offer gains in terms of 
decoding delay and memory use. We show simulation results 
below for the case when m is fixed at the value of 2, while n 
increases. By comparison, if n < 2m, there is a maximum of 
one sink related to a given sink r. We show simulation results 
below for the case of n = 2m. 

1) n > 2m, fixed m, m = 2: Fig. [8] plots the average first 
decoding time T^vg, corresponding upper and lower bounds 
ETijB, ETlb, and average memory use W^vg, as defined in 
Section UlI-CI Here m is fixed to the value of 2, n increases 
from 4 to 16, and the field size is q = 2. As discussed in 
Section IIV-AI ETub and ETlb are independent of n. As 
n increases, observe that Tavg stays approximately constant 
at about 1.3, while Wavg increases sublinearly. When n = 16, 
Wavg is approximately 6.3. On the other hand, recall from IITtI 
that a lower bound on the success probability of RLNC is (1 — 
d/{q — where | J| is the number of encoding nodes. 

In a combination network with n = 16 and m = 2, \ J\ = 
1 since only the source node codes. For a target decoding 
probability of 0.99, we have (1 - (^|) /{q - 1))^ > 0.99, thus 
q > 2.4 X 10"', and [log2 q~\ > 15. Since each encoding kernel 
contains at least one term, Wavg is lower bounded by [log2 q] • 
Hence, using ARCNC here reduces memory use by half when 
compared with RLNC. 

Fig. |8]also plots Tavg and Wavg when field size q increases 
from 2^ to 2*. As field size becomes larger, Tavg approaches 0. 
When q = 2^, the value of Tavg is close to 0.004. As discussed 
in Section HV-AI when q becomes sufficiently large, ARCNC 
terminates at t = 0, and generates the same code as RLNC. 
Also observe from this figure that as n increases from 4 to 
16, Wavg increases as well, but at different rates for different 
field sizes. Again, Wavg is lower bounded by [log2 <?] . When 
q = 2^, Wave follows an approximately linear trend, with 
an increment of less than 1 between n = 4 and n = 16. 
One explanation for this observation is that for m = 2, a 



Fig. 9. Average first decoding time and average memory use. n = 2m, n 
increases, field size q also increases. Also plotted are the computed upper and 
lower bounds on Tavg for q = 2. 

field size of q = 2^ is already sufficient for making ARCNC 
approximately the same as RLNC. 

2) n ~ 2m: Fig. |9] plots Tavg, Wavg, and corresponding 
bounds on Tavg when n = 27n, q ~ 2. Since m increases 
with n, ETjjB and ETlb change with the value of n as 
well. Observe that Tavg increases from approximately 1.27 
to approximately 1.45 as n increases from 4 to 12. In other 
words, even though more sinks are present, with each sink 
connected to more intermediate nodes, the majority of sinks 
are still able to achieve decodability within very few coding 
steps. However, since now n = 2m, any given sink r is related 
to all but one other sink; even a single sink requiring additional 
coding steps would force almost all sinks to use more memory 
to store longer encoding kernels. Compared with Fig. |9] Wavg 
appears linear in n in this case. 

Fig. |9] also plots Tavg and Wavg when q increases. Similar 
to the rn = 2 case shown in Fig. |9l Tavg approaches as 
q becomes larger Wavg appears linear in n for q < 2^, and 
piecewise linear for q = 2^. This is because Wavg is lower 
bounded by [log2 q~\ ■ When n becomes sufficiently large, this 
lower bound is surpassed, since q ^ 2^ no longer suffices in 
making all nodes decode at time 0, thus making ARCNC a 
good approximation of RLNC. 

B. Shuttle Network 

When there are cycles in the network, as discussed in 
Section III-Cl we numerically index edges, and assign local 
encoding kernels at i = according to the indices such that 
no logical contradictions exist in data transmitted around each 
cycle. Fig. [TO] plots Tavg and Wavg for the shuttle network, 
with the index assignment given in Fig. |5] As discussed in 
the example shown in Figs. |6] and [T] with this edge index 
index, both ri and r2 require at least 2 time steps to achieve 
decodability. This conclusion is verified by the plot shown in 
Figure [To] As field size q increases, Tavg converges to 1, while 
Wavg converges to 21og2(7. When q~ 2^, Tavg is 5.1. 

C. Acyclic and Cyclic Random Geometric Networks 

To see the performance of ARCNC in random networks, 
we use random geometric graphs llTSi as the network model. 
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Fig. 10. Average decoding delay and average code length for the shuttle 
network as a function of field size. 



Fig. 1 1 . Average first decoding time and average memory use in acyclic and 
cyclic random geometric graphs with 25 nodes, as a function of the number 
of receivers. Field size is g = 2^^. 



with added acyclic or cyclic constraints. In random geometric 
graphs, nodes are put into a geometrically confined area [0, 1]^, 
with coordinates chosen uniformly randomly. Nodes which are 
within a given distance are connected. Call this distance the 
connection radius. In our simulations, we set the connection 
radius to 0.4. The resulting graph is inherently bidirectional. 

For acyclic random networks, we number all nodes, with 
source as node 1, and sinks as nodes with the largest numbers. 
A node is allowed to transmit to only nodes with numbers 
larger than its own. An intermediate node on a path from the 
source to a sink can be a sink itself. To ensure the max-flow 
to each receiver is non-zero, one can choose the connection 
radius to make the graph connected with high probability; we 
fix this value to 0.4, and throw away instances where at least 
one receiver is not connected to the source. Once an acyclic 
random geometric network is generated, we use the smallest 
min-cut over all sinks as the source symbol rate, which is the 
number of source symbols generated at each time instant. 

Figs. [TT] and [12] plot the average first decoding time Tavg 
and average memory use tVavg in acyclic random geometric 
networks. Fig. [TT] shows the case where there are 25 nodes 
within the network, with more counted as sinks, while Fig. [12] 
shows the case where the number of sinks is fixed to 3, but 
more nodes are added to the network. In both cases, Tavg is 
less than 1, indicating that decodability is achieved in 2 steps 
with high probability. In Fig. [TT] the dependence of Wavg on 
the number of sinks is not very strong, since there are few 
sinks, and each node is connected to only a small portion of 
all nodes. In Fig. [12] Wava grows as the number of nodes 
increases, since on average, each node is connected to more 
neighboring nodes, thus its memory use is more likely to be 
affected by other sinks. 

To see the performance of ARCNC in cyclic random 
networks, note that random geometric graphs are inherently 
bidirectional. We apply the following modifications to make 
the network cyclic. First, we number all nodes, with source as 
node 1, and sinks as the nodes with the larges numbers. Sec- 
ond, we replace each bidirectional edge with 2 directed edges. 
Next, a directed edge from a lower numbered to a higher 
numbered node is removed from the graph with probability 



6 




qI 1 1 1 1 1 1 1 

10 15 20 25 30 35 40 45 

number of nodes 



Fig. 12. Average first decoding time and average memory use in a cyclic 
random geometric graph with 3 receivers, as a function of the total number 
of nodes in the network. Field size is 5 = 2^ . 



0.2, and a directed edge from a higher numbered to a lower 
numbered node is removed from the graph with probability 
0.8. Such edge removals ensure that not all neighboring node 
pairs form cycles, and cycles can exist with positive probabil- 
ities. We do not consider other edge removal probabilities in 
our simulations. The effect of random graph structure on the 
performance of ARCNC is a non-trivial problem and will not 
be analyzed in this paper. 

Figs.[TT]and[T2]also plot the average first decoding time and 
average memory use in cyclic random geometric networks. 
Again, in both cases, the average first decoding time Tavg 
is less than 1, indicating that decodability is achieved in 2 
steps with high probability. Wavg stays approximately constant 
when more nodes becomes sinks. On the other hand, when 
the number of sinks is fixed to 3, while more nodes are 
added to the network, W^vg first increases, then decreases in 
value. This is because as more nodes are added, since the 
connection radius stays constant at 0.4, each node is connected 
to more neighboring nodes. Sharing parents with more nodes 
first increase the memory use of a given node. However, as 
more nodes are added and more cycles form, edges are utilized 



more efficiently, thus bringing down both Tavg and Wavg- Note 
that when compared with the acycHc case, cycHc networks 
with the same number of nodes or same number of sinks 
require longer decoding time as well as more memory. This 
is expected, since with cycles, sinks are related to more nodes 
in general. 

VI. Conclusion 

We propose an adaptive random convolutional network code 
(ARCNC), operating in a small field, locally and automatically 
adapting to the network topology by incrementally growing the 
constraint length of the convolution process. Through analysis 
and simulations, we show that ARCNC performs no worse 
than random algebraic linear network codes in terms of decod- 
ability, while bringing significant gains in terms of decoding 
delay and memory use in some networks. There are three main 
advantage of ARCNC compared with scalar network codes and 
conventional convolutional network codes. Firstly, it operates 
in a small finite field, reducing the computation overheads 
of encoding and decoding operations. Secondly, it adapts to 
unknown network topologies, both acyclic and cyclic. Lastly, it 
allows codes of different constraint lengths to co-exist within 
a network, thus bringing practical gains in terms of smaller 
decoding delays and reduced memory use. The amount of 
gains achievable through ARCNC is dependent on the number 
of sinks connected to each other through mutual ancestors. In 
practical large-scale networks, the number of edges connected 
to an intermediate node or a sink is always bounded. Thus 
ARCNC could be beneficial in most general cases. 

One possible extension of this adaptive algorithm is to con- 
sider its use with other types of connections such as multiple 
multicast, or multiple unicast. Another possible direction of 
future research is to understand the impact of memory used 
during coding on the rates of innovative data flow through 
paths along the network. ARCNC presents a feasible solution 
to the multicast problem, offering gains in terms of delay and 
memory, but it is not obvious whether a constraint can be 
added to memory, while jointly optimizing rates achievable at 
sinks through this adaptive scheme. 
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